LLM Finetune with PEFT #99

avishniakov · 2024-04-12T14:15:16Z

This PR brings in the pipeline to run Mistral model fine-tuning using PEFT library on Viggio dataset.

Key highlights:

Project has one pipeline to run the full cycle of LLM fine-tuning:
- Data extraction and preparation
- Finetuning base model on new data
- Evaluation of the performances for base and fine-tuned model
- Promotion of the model to the target environment, if it outperforms the base and currently promoted one
Model can be trained on local(remote) orchestrator in full
Model can be trained on step operators, where only GPU intense steps will be pushed to step operator (we used Vertex AI step operator)
Here we also move around the previous LitGPT fine-tune project to llm-litgpt-finetuning and this new project becomes llm-lora-finetuning

P.S. Diff is a bit off due to project movement. LitGPT one was lift and shift - no changes in it.

Template update: zenml-io/template-llm-finetuning#4

dagshub · 2024-04-12T14:15:20Z

Join the discussion on DagsHub!

strickvl · 2024-04-12T14:15:45Z

@coderabbitai review

strickvl

Initial quick read. Looks good. Will read more thoroughly on Monday. Nice work on this!

llm-peft-finetune.tar.gz

llm-lora-finetuning/README.md

llm-lora-finetuning/materializers/directory_materializer.py

llm-lora-finetuning/run.py

llm-lora-finetuning/steps/evaluate_model.py

llm-lora-finetuning/utils/loaders.py

Co-authored-by: Alex Strick van Linschoten <[email protected]>

strickvl

Not much else to say beyond these comments (and the ones from before). Looks good overall.

llm-lora-finetuning/requirements.txt

llm-lora-finetuning/utils/callbacks.py

schustmi

Just a few tiny nits, lgtm otherwise

llm-lora-finetuning/utils/logging.py

llm-lora-finetuning/pipelines/train.py

avishniakov · 2024-04-17T07:02:39Z

@htahir1 are we ok to merge this? It will replace the current LORA example with PEFT one and LitGPT will be moved to a new directory. Any blockers?

schustmi · 2024-04-17T07:26:58Z

Just one more thing that came to my mind: We should probably also rename the llm-lora-finetuning template right?

Update e2e files for new project (#99)

avishniakov added 10 commits April 12, 2024 14:15

initial version

a252704

format

f012ee2

format

8ea3c62

update config

67b5809

update promotion

5b624bb

restruct and readme

c9af3e5

update docs

4ede5b9

more formatting

47dcb2e

update docs

68e5cde

move projects around

1b685da

avishniakov requested review from schustmi and strickvl April 12, 2024 14:15

strickvl added enhancement New feature or request internal labels Apr 12, 2024

strickvl requested changes Apr 12, 2024

View reviewed changes

avishniakov and others added 5 commits April 13, 2024 09:04

remove left-over

55d01d0

remove unused methods

42d88c0

Update llm-lora-finetuning/materializers/directory_materializer.py

4308290

Co-authored-by: Alex Strick van Linschoten <[email protected]>

Update llm-lora-finetuning/run.py

833c0d7

Co-authored-by: Alex Strick van Linschoten <[email protected]>

Update llm-lora-finetuning/README.md

5e8080f

Co-authored-by: Alex Strick van Linschoten <[email protected]>

strickvl reviewed Apr 15, 2024

View reviewed changes

llm-lora-finetuning/requirements.txt Show resolved Hide resolved

llm-lora-finetuning/utils/callbacks.py Show resolved Hide resolved

add docstrings

4829203

avishniakov requested a review from strickvl April 15, 2024 08:57

strickvl approved these changes Apr 15, 2024

View reviewed changes

schustmi requested changes Apr 16, 2024

View reviewed changes

llm-lora-finetuning/utils/logging.py Outdated Show resolved Hide resolved

llm-lora-finetuning/pipelines/train.py Outdated Show resolved Hide resolved

llm-lora-finetuning/pipelines/train.py Outdated Show resolved Hide resolved

review suggestions

c81180e

avishniakov requested a review from schustmi April 16, 2024 08:41

schustmi approved these changes Apr 16, 2024

View reviewed changes

htahir1 approved these changes Apr 18, 2024

View reviewed changes

safoinme added a commit that referenced this pull request Apr 25, 2024

Update e2e files for new project (#99)

1972552

avishniakov mentioned this pull request May 3, 2024

Multi GPU with PEFT on LLM #102

Merged

safoinme added a commit that referenced this pull request May 8, 2024

Merge pull request #105 from zenml-io/feature/youtube-video-e2e-example

6a65da6

Update e2e files for new project (#99)

avishniakov added 4 commits May 15, 2024 17:20

update configs

394c0f7

update in sync with the template

7c25ba0

fix bitsandbytes

a676db2

relax datasets

60dc440

avishniakov merged commit 5fc690d into main May 23, 2024
1 of 3 checks passed

avishniakov deleted the feature/OSSK-499-llm-finetune-with-peft branch May 23, 2024 14:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LLM Finetune with PEFT #99

LLM Finetune with PEFT #99

avishniakov commented Apr 12, 2024 •

edited

Loading

dagshub bot commented Apr 12, 2024

strickvl commented Apr 12, 2024

strickvl left a comment

strickvl left a comment

schustmi left a comment

avishniakov commented Apr 17, 2024

schustmi commented Apr 17, 2024

LLM Finetune with PEFT #99

LLM Finetune with PEFT #99

Conversation

avishniakov commented Apr 12, 2024 • edited Loading

dagshub bot commented Apr 12, 2024

strickvl commented Apr 12, 2024

strickvl left a comment

Choose a reason for hiding this comment

strickvl left a comment

Choose a reason for hiding this comment

schustmi left a comment

Choose a reason for hiding this comment

avishniakov commented Apr 17, 2024

schustmi commented Apr 17, 2024

avishniakov commented Apr 12, 2024 •

edited

Loading